AITopics | pre-ranking model

Collaborating Authors

pre-ranking model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

AIF: Asynchronous Inference Framework for Cost-Effective Pre-Ranking

Kou, Zhi, Sheng, Xiang-Rong, Han, Shuguang, Zhao, Zhishan, Cheng, Yueyao, Zhu, Han, Xu, Jian, Zheng, Bo

arXiv.org Artificial IntelligenceNov-21-2025

In industrial recommendation systems, pre-ranking models based on deep neural networks (DNNs) commonly adopt a sequential execution framework: feature fetching and model forward computation are triggered only after receiving candidates from the upstream retrieval stage. This design introduces inherent bottlenecks, including redundant computations of identical users/items and increased latency due to strictly sequential operations, which jointly constrain the model's capacity and system efficiency. To address these limitations, we propose the Asynchronous Inference Framework (AIF), a cost-effective computational architecture that decouples interaction-independent components, those operating within a single user or item, from real-time prediction. AIF reorganizes the model inference process by performing user-side computations in parallel with the retrieval stage and conducting item-side computations in a nearline manner. This means that interaction-independent components are calculated just once and completed before the real-time prediction phase of the pre-ranking stage. As a result, AIF enhances computational efficiency and reduces latency, freeing up resources to significantly improve the feature set and model architecture of interaction-independent components. Moreover, we delve into model design within the AIF framework, employing approximated methods for interaction-dependent components in online real-time predictions. By co-designing both the framework and the model, our solution achieves notable performance gains without significantly increasing computational and latency costs. This has enabled the successful deployment of AIF in the Taobao display advertising system.

artificial intelligence, computation, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2511.12934

Country:

North America > United States (0.28)
Europe > United Kingdom (0.28)

Genre: Research Report (1.00)

Industry:

Marketing (0.34)
Information Technology > Services (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Optimizing E-commerce Search: Toward a Generalizable and Rank-Consistent Pre-Ranking Model

Xu, Enqiang, Qiu, Yiming, Bai, Junyang, Zhang, Ping, Miao, Dadong, Wang, Songlin, Tang, Guoyu, Liu, Lin, Li, Mingming

arXiv.org Artificial IntelligenceMay-13-2024

Beyond these optimizations, meeting the system To enhance user experience and conversion efficiency, the online performance requirements presents a significant challenge. Contrasting search system is employed with a cascading architecture, mainly with existing industry works, we propose a novel method: a including recall and ranking. The ranking stage as the downstream Generalizable and RAnk-ConsistEnt Pre-Ranking Model (GRACE), component directly influences the efficiency of item sorting. Several which achieves: 1) Ranking consistency by introducing multiple superior ranking models have been identified in industrial research, binary classification tasks that predict whether a product is within such as MMoE [4], PLE [12], ESMM [5], DeepFM [1], DIN [18], the top-k results as estimated by the ranking model, which facilitates MIMN [8], SDIM [16], and SIM [12], with a focus on feature engineering, the addition of learning objectives on common point-wise behavioral sequence modeling, and objective function ranking models; 2) Generalizability through contrastive learning optimization. However, as the scale of products within the search of representation for all products by pre-training on a subset of system grows, there is an increasing demand for managing the ranking product embeddings; 3) Ease of implementation in feature time complexity of the sorting module.

proceedings, ranking model, representation, (11 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3626772.3661343

2405.05606

Country:

Asia > China > Beijing > Beijing (0.06)
North America > United States > District of Columbia > Washington (0.05)
North America > United States > Texas > Travis County > Austin (0.04)
(2 more...)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Industry: Information Technology > Services > e-Commerce Services (0.43)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.48)

Add feedback

Rethinking Large-scale Pre-ranking System: Entire-chain Cross-domain Models

Song, Jinbo, Huang, Ruoran, Wang, Xinyang, Huang, Wei, Yu, Qian, Chen, Mingming, Yao, Yafei, Fan, Chaosheng, Peng, Changping, Lin, Zhangang, Hu, Jinghe, Shao, Jingping

arXiv.org Artificial IntelligenceOct-12-2023

Industrial systems such as recommender systems and online advertising, have been widely equipped with multi-stage architectures, which are divided into several cascaded modules, including matching, pre-ranking, ranking and re-ranking. As a critical bridge between matching and ranking, existing pre-ranking approaches mainly endure sample selection bias (SSB) problem owing to ignoring the entire-chain data dependence, resulting in sub-optimal performances. In this paper, we rethink pre-ranking system from the perspective of the entire sample space, and propose Entire-chain Cross-domain Models (ECM), which leverage samples from the whole cascaded stages to effectively alleviate SSB problem. Besides, we design a fine-grained neural structure named ECMM to further improve the pre-ranking accuracy. Specifically, we propose a cross-domain multi-tower neural network to comprehensively predict for each stage result, and introduce the sub-networking routing strategy with $L0$ regularization to reduce computational costs. Evaluations on real-world large-scale traffic logs demonstrate that our pre-ranking models outperform SOTA methods while time consumption is maintained within an acceptable level, which achieves better trade-off between efficiency and effectiveness.

ecmm, pre-ranking system, proceedings, (12 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3511808.3557683

2310.08039

Country: Asia > China > Beijing > Beijing (0.05)

Genre: Research Report (0.40)

Industry: Information Technology > Services (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback

COPR: Consistency-Oriented Pre-Ranking for Online Advertising

Zhao, Zhishan, Gao, Jingyue, Zhang, Yu, Han, Shuguang, Lou, Siyuan, Sheng, Xiang-Rong, Wang, Zhe, Zhu, Han, Jiang, Yuning, Xu, Jian, Zheng, Bo

arXiv.org Artificial IntelligenceOct-9-2023

Cascading architecture has been widely adopted in large-scale advertising systems to balance efficiency and effectiveness. In this architecture, the pre-ranking model is expected to be a lightweight approximation of the ranking model, which handles more candidates with strict latency requirements. Due to the gap in model capacity, the pre-ranking and ranking models usually generate inconsistent ranked results, thus hurting the overall system effectiveness. The paradigm of score alignment is proposed to regularize their raw scores to be consistent. However, it suffers from inevitable alignment errors and error amplification by bids when applied in online advertising. To this end, we introduce a consistency-oriented pre-ranking framework for online advertising, which employs a chunk-based sampling module and a plug-and-play rank alignment module to explicitly optimize consistency of ECPM-ranked results. A $\Delta NDCG$-based weighting mechanism is adopted to better distinguish the importance of inter-chunk samples in optimization. Both online and offline experiments have validated the superiority of our framework. When deployed in Taobao display advertising system, it achieves an improvement of up to +12.3\% CTR and +5.6\% RPM.

consistency, pre-ranking model, proceedings, (15 more...)

arXiv.org Artificial Intelligence

2306.03516

Country:

Asia > China > Beijing > Beijing (0.05)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.40)

Industry:

Marketing (1.00)
Information Technology > Services (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Towards a Better Tradeoff between Effectiveness and Efficiency in Pre-Ranking: A Learnable Feature Selection based Approach

Ma, Xu, Wang, Pengjie, Zhao, Hui, Liu, Shaoguo, Zhao, Chuhan, Lin, Wei, Lee, Kuang-Chih, Xu, Jian, Zheng, Bo

arXiv.org Artificial IntelligenceMay-17-2021

In real-world search, recommendation, and advertising systems, the multi-stage ranking architecture is commonly adopted. Such architecture usually consists of matching, pre-ranking, ranking, and re-ranking stages. In the pre-ranking stage, vector-product based models with representation-focused architecture are commonly adopted to account for system efficiency. However, it brings a significant loss to the effectiveness of the system. In this paper, a novel pre-ranking approach is proposed which supports complicated models with interaction-focused architecture. It achieves a better tradeoff between effectiveness and efficiency by utilizing the proposed learnable Feature Selection method based on feature Complexity and variational Dropout (FSCD). Evaluations in a real-world e-commerce sponsored search system for a search engine demonstrate that utilizing the proposed pre-ranking, the effectiveness of the system is significantly improved. Moreover, compared to the systems with conventional pre-ranking models, an identical amount of computational resource is consumed.

architecture, efficiency, feature field, (12 more...)

arXiv.org Artificial Intelligence

2105.07706

Country:

North America > Canada (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > China (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.67)

Add feedback